Comparison of De Novo Transcriptome Assemblers and k-mer Strategies Using the Killifish, Fundulus heteroclitus.

نویسندگان

  • Satshil B Rana
  • Frank J Zadlock
  • Ziping Zhang
  • Wyatt R Murphy
  • Carolyn S Bentivegna
چکیده

BACKGROUND De novo assembly of non-model organism's transcriptomes has recently been on the rise in concert with the number of de novo transcriptome assembly software programs. There is a knowledge gap as to what assembler software or k-mer strategy is best for construction of an optimal de novo assembly. Additionally, there is a lack of consensus on which evaluation metrics should be used to assess the quality of de novo transcriptome assemblies. RESULT Six different assembly strategies were evaluated from four different assemblers. The Trinity assembly was used in its default 25 single k-mer value while Bridger, Oases, and SOAPdenovo-Trans were performed with multiple k-mer strategies. Bridger, Oases, and SOAPdenovo-Trans used a small multiple k-mer (SMK) strategy consisting of the k-mer lengths of 21, 25, 27, 29, 31, and 33. Additionally, Oases and SOAPdenovo-Trans were performed using a large multiple k-mer (LMK) strategy consisting of k-mer lengths of 25, 35, 45, 55, 65, 75, and 85. Eleven metrics were used to evaluate each assembly strategy including three genome related evaluation metrics (contig number, N50 length, Contigs >1 kb, reads) and eight transcriptome evaluation metrics (mapped back to transcripts (RMBT), number of full length transcripts, number of open reading frames, Detonate RSEM-EVAL score, and percent alignment to the southern platyfish, Amazon molly, BUSCO and CEGMA databases). The assembly strategy that performed the best, that is it was within the top three of each evaluation metric, was the Bridger assembly (10 of 11) followed by the Oases SMK assembly (8 of 11), the Oases LMK assembly (6 of 11), the Trinity assembly (4 of 11), the SOAP LMK assembly (4 of 11), and the SOAP SMK assembly (3 of 11). CONCLUSION This study provides an in-depth multi k-mer strategy investigation concluding that the assembler itself had a greater impact than k-mer size regardless of the strategy employed. Additionally, the comprehensive performance transcriptome evaluation metrics utilized in this study identified the need for choosing metrics centered on user defined research goals. Based on the evaluation metrics performed, the Bridger assembly was able to construct the best assembly of the testis transcriptome in Fundulus heteroclitus.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Clustering of Short Read Sequences for de novo Transcriptome Assembly

Given the importance of transcriptome analysis in various biological studies and considering thevast amount of whole transcriptome sequencing data, it seems necessary to develop analgorithm to assemble transcriptome data. In this study we propose an algorithm fortranscriptome assembly in the absence of a reference genome. First, the contiguous sequencesare generated using de Bruijn graph with d...

متن کامل

In vitro effects of prolactin, cortisol, aldosterone, and cyclic amp on branchial Na+, K+ - activated ATPase of the killifish, Fundulus heteroclitus

!!! vitro effects of prolactin, cortisol, aldosterone, and cyclic ad.enosine monophosphate were studied on branchial Na+, K+ activated ATPase of freshwater adapted killifish, Fundulus heteroclitus. Decreases in ATPase activity were found after treatment with prolactin and aldosterone. Cyclic AMP caused an increase in ATPase activity. The action of cortisol on this enzyme was not clear. A dose r...

متن کامل

A novel aquaporin 3 in killifish (Fundulus heteroclitus) is not an arsenic channel.

The Atlantic killifish (Fundulus heteroclitus) is a model environmental organism that has an extremely low assimilation rate of environmental arsenic. As a first step in elucidating the mechanism behind this phenomenon, we used quantitative real-time PCR to identify aquaglyceroporins (AQPs), which are arsenite transporters, in the killifish gill. A novel homolog killifish AQP3 (kfAQP3a) was clo...

متن کامل

Evolution of tolerance to PCBs and susceptibility to a bacterial pathogen (Vibrio harveyi) in Atlantic killifish (Fundulus heteroclitus) from New Bedford (MA, USA) harbor.

A population of the non-migratory estuarine fish Fundulus heteroclitus (Atlantic killifish) resident to New Bedford (NB), Massachusetts, USA, an urban harbor highly contaminated with polychlorinated biphenyls (PCBs), demonstrates recently evolved tolerance to some aspects of PCB toxicity. PCB toxicology, ecological theory, and some precedence supported expectations of increased susceptibility t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • PloS one

دوره 11 4  شماره 

صفحات  -

تاریخ انتشار 2016